Information processing apparatus and method, and program

ABSTRACT

An information processing apparatus including a plurality of feature amount extraction parts which extract, from content, a plurality of feature amounts that contain information concerning a camera motion in a frame of the content. A display control part displays frames of the content and a GUI on the displayed content frame. The GUI corresponds to an operation concerning the camera motion on the basis of the extracted camera motion information.

CROSS REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of Japanese Priority PatentApplication JP 2012-257827 filed Nov. 26, 2012, the entire contents ofwhich are incorporated herein by reference.

BACKGROUND

The present disclosure relates to an information processing apparatusand method, and program, and particularly relates to an informationprocessing apparatus and method, and program which can improveoperability for grasping a substance of content.

There has been a technology in which a video sequence is indexed bydividing segments into segments with a camera being moved and segmentswith a camera being fixed as a related art as is described in NationalPublication of International Patent Application No. 2005-538584 thatapplies a result of recognizing a camera motion in order to check asubstance of a moving picture content.

In addition, there has been a technology in which an index image isgenerated for each type of camera motion as is described in JapanesePatent Laid-Open No. 6-276467.

These technologies enable the substance of the moving picture content tobe grasped by displaying the camera motion along a time line or the likeon a preview screen for checking the moving picture content for acertain duration.

SUMMARY

However, there has not been proposed a technology in which a user mayuse the camera motion to operate a preview of the content in the previewscreen.

The present disclosure is made in consideration of such a situation, andimproves operability for grasping a substance of content.

According to an embodiment of the present disclosure, there is providedan information processing apparatus including a plurality of featureamount extraction parts configured to extract, from content, a pluralityof feature amounts containing information concerning a camera motion ina frame, and a display control part configured to display a GUIcorresponding to an operation concerning the camera motion in the frameon the frame in the content on the basis of the information concerningthe camera motion extracted by the feature amount extraction parts.

The information processing apparatus may further include a spacerecognition part configured to recognize a relationship in a spacebetween frames using the information concerning the camera motion and animage feature amount of the plurality of feature amounts. The displaycontrol part may display the GUI corresponding to the operationconcerning the camera motion in the frame on the frame in the content onthe basis of the information concerning the camera motion extracted bythe feature amount extraction parts and the relationship in the spacebetween the frames recognized by the space recognition part.

The GUI corresponding to the operation concerning the camera motion maybe a GUI enabling a camera angle to be switched to another angle.

The display control part may switch a display of the frame to a displayof a frame of an angle corresponding to an operation on the GUI by auser.

The GUI corresponding to the operation concerning the camera motion maybe a GUI enabling a zoom-in or zoom-out operation.

The display control part may switch a display of the frame to a displayof a frame of zoom-in or zoom-out corresponding to an operation on theGUI by a user.

According to an embodiment of the present disclosure, there is providedan information processing method including extracting, by an informationprocessing apparatus, from content, a plurality of feature amountscontaining information concerning a camera motion in a frame, anddisplaying, by the information processing apparatus, a GUI correspondingto an operation concerning the camera motion in the frame on the framein the content on the basis of the extracted information concerning thecamera motion.

According to an embodiment of the present disclosure, there is provideda program causing a computer to function as a plurality of featureamount extraction parts configured to extract, from content, a pluralityof feature amounts containing information concerning a camera motion ina frame, and a display control part configured to display a GUIcorresponding to an operation concerning the camera motion in the frameon the frame in the content on the basis of the information concerningthe camera motion extracted by the feature amount extraction parts.

According to one embodiment of the present disclosure, a plurality offeature amounts containing information concerning a camera motion foreach frame are extracted from content. Then, a GUI corresponding to anoperation concerning the camera motion for the frame is displayed on theframe in the content on the basis of the extracted informationconcerning the camera motion.

According to an embodiment of the present disclosure, operability forgrasping a substance of content may be improved.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram showing a configuration example of an informationprocessing apparatus applying the present technology;

FIG. 2 is flowchart illustrating a content input process of theinformation processing apparatus;

FIG. 3 is a flowchart illustrating a preview display process;

FIG. 4 is a flowchart illustrating a preview reproduction process;

FIG. 5 is a diagram showing an example of a preview screen;

FIG. 6 is a diagram showing an example of a scene with a camera beingpanned rightward;

FIG. 7 is a diagram showing an example of a preview display section inthe case of FIG. 6;

FIG. 8 is a diagram showing an example of a scene in which a camerazooms out;

FIG. 9 is a diagram showing an example of a preview display section inthe case of FIG. 8;

FIG. 10 is a diagram showing an example of the preview display sectionin the case of FIG. 8;

FIG. 11 is a diagram showing an example of the preview display sectionin the case of FIG. 8;

FIG. 12 is a diagram showing an example of a scene shot with repeatedpan/tilt by a camera;

FIG. 13 is a diagram showing an example of space recognition in the caseof FIG. 12;

FIG. 14 is a diagram showing an example of a preview display section inthe case of FIG. 12; and

FIG. 15 is a block diagram showing a configuration example of acomputer.

DETAILED DESCRIPTION OF THE EMBODIMENT(S)

Hereinafter, preferred embodiments of the present disclosure will bedescribed in detail with reference to the appended drawings. Note that,in this specification and the appended drawings, structural elementsthat have substantially the same function and structure are denoted withthe same reference numerals, and repeated explanation of thesestructural elements is omitted.

Hereinafter, a description will be given of an embodiment for carryingout the present disclosure (referred to as embodiment below). Thedescription is given in the order as follows.

1. First embodiment (information processing apparatus)

2. Second embodiment (computer)

<1. First embodiment>

[Information processing apparatus configuration of present technology]

FIG. 1 is a diagram showing a configuration example of an informationprocessing apparatus applying the present technology,

An information processing apparatus 11 shown in FIG. 1 displays featureamounts of content extracted from the content by way of a recognitiontechnology such as image recognition, speech recognition, and characterrecognition in a screen for previewing content along a time line.Further, the information processing apparatus 11, in the case of pausingthe content being previewed, displays on the paused frame (image) a GUIcorresponding to an operation concerning the camera motion for theframe. The information processing apparatus 11 is constituted by apersonal computer, for example.

In an example of FIG. 1, the information processing apparatus 11 isconfigured to include a content input part 21, content archive 22,feature amount extraction parts 23-1 to 23-3, content feature amountdatabase 24, display control part 25, operation input part 26, displaypart 27, feature amount extraction part 28, search part 29, and spacerecognition part 30.

The content input part 21 receives content from the outside not shown orthe like and supplies the received content to the feature amountextraction parts 23-1 to 23-3. Additionally, the content input part 21registers the received content in the content archive 22.

The content archive 22 has the content registered therein from thecontent input part 21.

The feature amount extraction parts 23-1 to 23-3 perform the imagerecognition, speech recognition, character recognition and the like onthe content to extract each of a plurality of feature amounts includingan image feature amount, speech feature amount and the like. Forexample, the feature amount extraction part 23-3 extracts particularlyinformation concerning the camera motion (hereafter, referred to ascamera motion information) as one of the feature amounts from thecontent. The camera motion information includes information of camera'smotion such as pan and tilt, and information of lens's motion of thecamera such as zoom-in and zoom-out. As the camera motion information,information of a sensor sensing the camera motion in shooting thecontent or the like can be also used.

The feature amount extraction parts 23-1 to 23-3 register the extractedfeature amount of the content in the content feature amount database 24.Here, the feature amount extraction parts 23-1 to 23-3 include threefeature amount extraction parts, but, the number thereof is not limitedto three and varies depending on a type (number) of the extractedfeature amounts. Hereinafter, the feature amount extraction parts 23-1to 23-3, when not necessary to be distinguished from each other, aremerely referred to as the feature amount extraction part 23.

The content feature amount database 24 has the feature amount of thecontent extracted by the feature amount extraction part 23 registeredtherein.

The display control part 25 retrieves, in response to a user instructionfrom the operation input part 26, content to be previewed and a featureamount of the content from the content archive 22 and the contentfeature amount database 24, respectively. The display control part 25generates a preview screen on the basis of a preview image of theretrieved content and the information concerning the feature amount ofthe content, and controls the display part 27 to display the generatedpreview screen. In displaying the preview screen, the display controlpart 25, at a time when supplying text or image information input viathe operation input part 26 for which an instruction is issued by theuser to the feature amount extraction part 28, receives a search resultsupplied in response thereto from the search part 29, The displaycontrol part 25 displays the preview screen on the basis of the searchresult.

Further, the display control part 25, in displaying the preview screenwhen a moving picture being preview-reproduced is paused by a useroperation on the operation input part 26, determines whether or not itis possible to move a viewpoint or angle of the camera from the pausedimage (frame) (that is, operation concerning the camera motion) on thebasis of the camera motion information and space information of thecontent feature amount database 24. If the viewpoint or angle of thecamera can be moved from the paused image, the display control part 25displays the GUI (Graphical User Interface) to be superimposed on theimage, the GUI corresponding to the operation concerning the cameramotion and including a button for enabling the movement (operation) ofthe viewpoint or angle of the camera and the like. In the case where thebutton is pushed by the user operation on the operation input part 26,the display control part 25 switches the display to an image ofdifferent angle corresponding to the pushed button, and moves a lineindicating a reproducing position in the time line to a position of theswitched image.

Further, the display control part 25, in displaying the preview screen,performs modification, update and the like on the information registeredon the content feature amount database 24 on the basis of correction forthe feature amount input via the operation input part 26 and the like.

The operation input part 26 includes a mouse, a touch panel laminated onthe display part 27, and the like, for example. The operation input part26 supplies a signal in response to the user operation to the displaycontrol part 25. The display part 27 displays the preview screengenerated by the display control part 25.

The feature amount extraction part 28 extracts the feature amount of thetext or image information that is supplied from the display control part25 and the user issues an instruction for, and supplies the featureamount to the search part 29. The search part 29 searches the contentfeature amount database 24 for a feature amount similar to the featureamount from the feature amount extraction part 28 and supplies thesearch result to the display control part 25.

The space recognition part 30 recognizes identity of an object in avideo from the camera motion information, image feature amount and thelike of the feature amounts of the contents registered in the contentfeature amount database 24 to recognize space information of a space inthe video, space identity and the like. For example, recognized is thespace information such as a positional relationship and magnitude(enlargement and reduction) relationship in a space between frames. Thespace recognition part 30 registers the recognized space information inthe content feature amount database 24.

Operation of Information Processing Apparatus

Subsequently, a description will be given of a content input process ofthe information processing apparatus 11 with reference to a flowchart inFIG. 2.

At step S11, the content input part 21 receives content from the outsidenot shown or the like. The content input part 21 supplies the receivedcontent to the feature amount extraction parts 23-1 to 23-3.

At step S12, the feature amount extraction parts 23-1 to 23-3 performthe image recognition, speech recognition, character recognition and thelike on the content from the content input part 21 to extract each ofthe feature amounts including the image feature amount, speech featureamount and the like. Here, at step S12, the camera motion information isalso extracted. At step S13, the feature amount extraction parts 23-1 to23-3 register the extracted feature amount of the content in the contentfeature amount database 24.

At step S14, the content input part 21 registers the received content inthe content archive 22.

At step S15, the space recognition part 30 recognizes identity of anobject in a video from the camera motion information, image featureamount and the like of the feature amounts of the contents registered inthe content feature amount database 24 to recognize a space in thevideo.

At step S16, the space recognition part 30 registers the recognizedspace information in the content feature amount database 24.

A description will be given of a preview display process of the contentwhich is carried out by use of the content and content feature amountregistered as described above, with reference to a flowchart in FIG. 3.

The user operates the operation input part 26 to select content to bepreviewed. The information of the content selected by the user issupplied via the operation input part 26 to the display control part 25.

At step S31, the display control part 25 selects the content accordingto the information from the operation input part 26. At step S32, thedisplay control part 25 acquires the content selected at step S31 fromthe content archive 22.

At step S33, the display control part 25 acquires the feature amount ofthe content selected at step S31 from the content feature amountdatabase 24.

At step S34, the display control part 25 displays a preview screen. Inother word, the display control part 25 generates the preview screen inwhich the information concerning the various feature amounts aredisplayed along the time line on the basis of the acquired content andthe acquired content feature amount. The display control part 25controls the display part 27 to display the generated preview screen(preview screen 51 shown in FIG. 5 described later). Here, displayedalong the time line is not only feature amount information but alsoinformation concerning the feature amount. The information concerningthe feature amount includes the feature amount information, informationobtained by use of the feature amount, or the result retrieved by use ofthe feature amount.

At step S35, the display control part 25 determines whether or not aninstruction for preview reproduction is issued on the basis of anoperation signal from the operation input part 26. If determined at stepS35 that the instruction for the preview reproduction is issued, thedisplay control part 25 carries out the preview reproduction process atstep S36. This preview reproduction process is described later withreference to FIG. 4, and in a process at step S35, an imagecorresponding to the operation on the content is displayed on thepreview screen (preview screen 51 shown FIG. in 5 described later), aline indicating a reproducing position in the time line is moved to aposition of the displayed image and displayed. The operation on thecontent includes an operation for carrying out preview reproduction,pause, fast-forward, and fast-rewind, for example.

If determined at step S35 that the instruction for the previewreproduction is not issued, step S36 is skipped and the process proceedsto step S37. At step S37, the display control part 25 determines whetheror not to end display of the preview screen. If the user issues aninstruction for the end via the operation input part 26, at step S37, itis determined to end the preview screen and the display of the previewscreen ends.

On the other hand, at step S37, if it is determined not to end thedisplay of the preview screen, the process returns to step S35 and stepS35 and the subsequent steps are repeated.

Subsequently, a description will be given of a preview screenreproduction process at step S36 in FIG. 3 with reference to a flowchartin FIG. 4.

At step S51, the display control part 25 carries out previewreproduction. This allows the reproduced moving picture to be displayedin the preview display section 61 described later in FIG. 5.

At step S52, the display control part 25 changes the reproducingposition display in the time line to a position of the image beingreproduced. In other words, a line indicating the reproducing positionin the time line is moved to the position of the image being reproduced.

At step S53, the display control part 25 determines whether or not aninstruction for pause is issued on the basis of an operation signal fromthe operation input part 26. If determined at step S53 that theinstruction for the pause is issued, the process proceeds to step S54.

The display control part 25, at step S54, pauses the moving picturebeing reproduced. Then, at step S55, if the viewpoint or angle of thecamera can be moved from the paused image, the display control part 25displays the GUI such as a button for enabling the movement (operation)of the viewpoint or angle of the camera on the basis of the cameramotion information and the space information of the content featureamount database 24.

At step S56, the display control part 25 determines whether or not thecamera angle is operated by the user on the basis of an operation signalfrom the operation input part 26. At step S56, if the button isdepressed by the user operation on the operation input part 26, thecamera angle is determined to be operated by the user, and the processproceeds to step S57. At step S57, the display control part 25 switchesthe display to an image of different angle corresponding to the operatedbutton.

Then, the display control part 25, at step S58, changes the reproducingposition display in the time line to a position of the display-switchedimage. In other words, a line indicating the reproducing position in thetime line is moved to the position of the display-switched image, andthe process proceeds to step S62.

On the other hand, if determined at step S53 that the instruction forthe pause is not issued, the process proceeds to step S59. At step S59,the display control part 25 determines whether or not another operation(e.g., fast-rewind or fast-forward) is made by the user on the basis ofan operation signal from the operation input part 26. If determined atstep S59 that another operation is made by the user, the processproceeds to step S60.

The display control part 25, at step S60, carries out a reproductionprocess corresponding to the operation. For example; if an instructionfor fast-forward is issued, the operation of fast-forward for thepreview image or the like is made. Then, at step S61, the displaycontrol part 25 changes the reproducing position display in the timeline to a position of the image being displayed. In other words, a lineindicating the reproducing position in the time line (reproducingposition display) is moved to the position of the image being displayed,and the process proceeds to step S62.

If determined at step S59 that another operation is not made, theprocess proceeds to step S62.

At step S62, the display control part 25 determines whether or not thecontent is reproduced to the end. If determined at step S62 that thecontent is reproduced to the end, the preview reproduction process ends.On the other hand, if determined at step S62 that the content is notreproduced to the end, the process returns to step S51 and step S51 andthe subsequent steps are repeated.

Example of Preview Screen

FIG. 5 shows an example of the preview screen.

An example in FIG. 5 shows the preview screen 51 described at step S34in FIG. 3 or the like, for example.

The preview screen 51 includes the preview display section 61 in which amoving picture of the content can be previewed, and a time line displaysection 62 which is located lower than the preview display section 61and displayed by selecting a upper left tab.

The preview display section 61, in response to the user operation on anoperation button provided immediately below the preview display section61, reproduces and previews the moving picture of the content. Theoperation button includes a fast-rewind button 72, skip-to-start button73, reproduction button 74, skip-to-end button 75, fast-forward button76, and stop button 77. Here, the reproduction button 74 is areproduction button while reproduction of the moving picture of thecontent is stopped, but becomes a pause button while the moving pictureof the content is being reproduced. The preview display section 61displays a box 71 for selecting a face in the displayed content whichundergoes a facial recognition in a face image display section 85described later.

The time line display section 62 displays the information concerning aplurality of feature amounts extracted by the feature amount extractionparts 23-1 to 23-3 in FIG. 1 along the time line. Moreover, a line 63indicating a reproducing position of an image (frame) currentlydisplayed in the preview display section 61 is provided on the timeline, and the user can grasp the reproducing position of the content onthe time line by getting a look at the line 63. A display position ofthis line 63 is changed in response to the image displayed in thepreview display section 61.

Further, displayed on the right side of the time line display section 62is a feature amount list 64 which enables selection of display ornon-display on the time line display section 62. The user can check oruncheck a box arranged on the left side of the feature amount list 64 toselect the display or non-display of the information concerning thefeature amount and display only information concerning the desiredfeature amount.

Note that, in the example in FIG. 5, only the fourth top box “Relevance”in the feature amount list 64 is unchecked. That is, the time linedisplay section 62 in FIG. 5 does not display an importance displaysection which is to be displayed by checking “Relevance”.

Further, a digest generating display section 65 is actually providedwhich is for generating a digest moving picture and the like at the sameposition as the time line display section 62, but not shown in theexample in FIG. 5. By selecting a tab provided upper left of those, thedigest generating display section 65 can be displayed in place of thetime line display section 62.

The time line display section 62 includes a scene change image displaysection 81, speech waveform display section 82, text search resultdisplay section 83, image search result display section 84, face imagedisplay section 85, object image display section 86, human speech regiondisplay section 87, and camera motion information display section 88 inthis, order from the top. Any of them is a display section fordisplaying the information concerning the feature amount.

The scene change image display section 81 is displayed in the time linedisplay section 62 by checking “Thumbnail” in the feature amount list64. In the scene change image display section 81, a thumbnail image of ahead frame image for each scene found by scene change is displayed onthe time line as one of the feature amounts. Note that a scene headimage is referred to as a scene change image below.

The speech waveform display section 82 is displayed in the time linedisplay section 62 by checking “Wave form” in the feature amount list64. In the speech waveform display section 82, a speech waveform of thecontent is displayed on the time line as one of the feature amounts.

The text search result display section 83 is displayed in the time linedisplay section 62 by checking “Keyword Spotting” in the feature amountlist 64. In the text search result display section 83, displayed is aresult of searching the content feature amount database 24 for the text(“president” in case of the example in FIG. 5) the user inputs byoperating the operation input part 26 on the basis of the featureamounts from the speech recognition or character recognition.

The image search result display section 84 is displayed in the time linedisplay section 62 by checking “Image Spotting” in the feature amountlist 64. In the image search result display section 84, displayed is (athumbnail image of) a result of searching the content feature amountdatabase 24 for a scene similar to the image the user selects byoperating the operation input part 26 on the basis of the feature amountfrom the image recognition.

The face image display section 85 is displayed in the time line displaysection 62 by checking “Face” in the feature amount list 64. In the faceimage display section 85, displayed is, from content feature amountdatabase 24, (a thumbnail image of) a feature amount similar to thefeature amount from facial recognition which is obtained by recognizinga face selected by the box 71 in the preview display section 61.

The object image display section 86 is displayed in the time linedisplay section 62 by checking “Capitol Hill” in the feature amount list64. Here, in the example in FIG. 5, “Capitol Hill” is an example of anobject, but an object is not limited to “Capitol Hill” and can bedesignated by the user. In the object image display section 86,displayed is (a thumbnail image of) a result of searching the contentfeature amount database 24 on the basis of the feature amount fromrecognition of an object (Capitol Hill in case of FIG. 5) designated bythe user.

Note that the example is shown in which the face image and the objectimage are separately displayed, but the face is one of the objects. Theimage displayed in the face image display section 85 and the objectimage display section 86 may be an image (thumbnail image) obtained bytrimming an extraction object from an original image.

The human speech region display section 87 is displayed in the time linedisplay section 62 by checking “Human Voice” in the feature amount list64. In the human speech region display section 87, displayed is a humanspeech region, music region or the like found by the feature amount fromthe speech recognition. Here, the human speech region display section 87may display, as shown in FIG. 5, not only a region in which a humanspeeches but also a mark according to a sex or age of the human ofspeech.

The camera motion information display section 88 is displayed in thetime line display section 62 by checking “Camera Motion” in the featureamount list 64. In the camera motion information display section 88,displayed is a region having the camera motion information such as pan,tilt, zoom-in, zoom-out or the like which is the feature amount from thecamera motion recognition.

In the preview screen 51, various feature amounts, such as the featureamounts described above as the examples, which can be extracted from thecontent and the information obtained using the feature amounts aredisplayed along the time line.

Here, in the present technology, the GUI using particularly the cameramotion information is displayed in the preview display section 61 in thepreview screen 51 described above with reference to FIG. 5, and theoperation on the screen enables operations of the camera angle andzoom-in/zoom-out for the video. A description thereof will be givenspecifically below.

Example of Camera Motion Information (Pan)

FIG. 6 is a diagram showing an example of a scene with a camera beingpanned rightward from t=0 to t=5.

For example, in the case where reproduction is started from an image att=0 and paused at a timing of t=3, the space recognition part 30 canrecognize that a space extends on the right and left of the image at t=3owing to the camera motion information that the camera is pannedrightward from t=0 to t=5.

Therefore, the display control part 25, in the preview display section61 paused at t=3, as shown in FIG. 7, superimposes a left arrow button101 for panning leftward and a right arrow button 102 for panningrightward as the GUI on the image at t=3 in FIG. 6. On the other hand,the left arrow button 101 and the right arrow button 102 are the GUIenabling the camera angle to be switched to a different angle. Note thatthe GUI enabling the camera angle to be switched to a different angle isnot limited to the right arrow button and left arrow button describedabove. Additionally, a shape of the GUI may not be necessarily a button.

When the user operates the operation input part 26 including a mouse andthe like, for example, by clicking (depressing) the right arrow button102, the camera angle is panned rightward to bring the reproducingposition into t=4, and thus, the display in the preview display section61 is switched from the image at t=3 to the image at t=4 in FIG. 6.

Alternatively, when the user operates the operation input part 26 toclick (depress) the left arrow button 101, the camera angle is pannedleftward to bring the reproducing position into t=2, and thus, thedisplay in the preview display section 61 is switched from the image att=3 to the image at t=2 in FIG. 6.

At this time, the similar operation can be made not only by clicking thebutton but also by directing the right and left directions using mouseoperation such as a mouse gesture, and by directing the right and leftdirections using a finger gesture on the touch panel laminated on thedisplay part 27.

Here, these arrow buttons can be displayed even during reproduction ofthe moving picture, and for example, in the case where the left arrowbutton 101 is clicked at the moment of t=3 during reproduction of themoving picture, reverse reproduction may be performed from the image att=3 to the image at t=0.

Example of Camera Motion Information (Zoom)

FIG. 8 is a diagram showing an example of a scene in which a camerazooms out from t=0 to t=4.

For example, in the case where reproduction is started from the image att=0 and paused at a timing of t=2, the space recognition part 30 canrecognize that a space extends around the image at t=2 owing to thecamera motion information that the camera zooms out from t=0 to t=4.Here, the camera zooms in in the reverse direction (from t=4 to t=0).

Therefore, the display control part 25, in the preview display section61 paused at t=2, as shown in A of FIG. 9, superimposes a plus markbutton 111 and a minus mark button 112 as the GUI on one portion on theimage at t=2. On the other hand, the plus mark button 111 and the minusmark button 112 are the GUI enabling the zoom-in/zoom-out operation.

Alternatively, the display control part 25, in the preview displaysection 61 paused at t=2, as shown in B of FIG. 9, superimposes outwardarrow buttons 113-1 to 113-4 and inward arrow buttons 114-1 to 114-4 asthe GUI on the image at t=2 in four directions. On the other hand, theoutward arrow buttons 113-1 to 113-4 and the inward arrow buttons 114-1to 114-4 are also the GUI enabling the zoom-out/zoom-in operation. Here,the GUI enabling the zoom-out/zoom-in operation is not limited to theplus mark button, minus mark button, inward arrow button and outwardarrow button described above. Additionally, a shape of the GUI may notbe necessarily a button.

In the example in A of FIG. 9, when the plus mark button 111 is clicked,the screen is zoomed in to bring the reproducing position into t=1 andthus, the display in the preview display section 61 is switched from theimage at t=2 to the image at t=1 in FIG. 8.

On the other hand, in the example in A of FIG. 9, when the minus markbutton 112 is clicked, the screen is zoomed out to bring the reproducingposition into t=3, and thus, the display of the preview display section61 is switched from the image at t=2 to the image at t=3 in FIG. 8.

Further, in the example in B of FIG. 9, when any of the inward arrowbuttons 114-1 to 114-4 is clicked, the screen is zoomed in to bring thereproducing position into t=1, and thus, the display of the previewdisplay section 61 is switched from the image at t=2 to the image at t=1in FIG. 8.

On the other hand, in the example in B of FIG. 9, when any of theoutward arrow buttons 113-1 to 113-4 is clicked, the screen is zoomedout to bring the reproducing position into t=3, and thus, the display ofthe preview display section 61 is switched from the image at t=2 to theimage at t=3 in FIG. 8.

At this time, as is the case of panning, the similar operation can bemade not only by clicking the button but also by directing the directionof zoom-in or zoom-out using the mouse operation such as a mouse wheel.Moreover, the similar operation can be made by directing the directionof zoom-in or zoom-out using a finger gesture on the touch panellaminated on the display part 27.

Here, these arrow buttons can be displayed even during reproduction ofthe moving picture, and for example, in the case where the plus markbutton 111 is clicked at the moment of t=2 during reproduction of themoving picture, reverse reproduction may be performed from the image att=2 to the image at t=0.

Further, in the example in B of FIG, 9, when each of the arrow buttonsis clicked, the enlargement or reduction can be made only in thedirection of the clicked arrow button. For example, the outward arrowbutton 113-4 which is located at the upper right and points to the upperright direction is clicked, the image at t=3 in FIG. 8 is trimmed suchthat an image as shown in the FIG. 10 is displayed in the previewdisplay section 61.

In other words, the image in FIG. 10 is an image obtained by enlargingthe image at t=2 only in the upper right direction and is generated bytrimming the image at t=3.

Note that in this case, the reproducing position is t=3, but isdifferent from the usual reproducing position display since the imagedisplayed in the preview display section 61 does not coincide with theactual image at t=3. Therefore, a marker indicating the reproducingposition may be changed in color and displayed.

Similarly, the space recognition part 30 can recognize that a space inthe vertical and horizontal directions of the image at t=2 also extendowing to the camera motion information that the camera zooms out fromt=0 to t=4.

Therefore, the display control part 25 can also, as shown in A of FIG,11, superimpose arrow buttons 121 to 124 for enabling a pan/tiltoperation in the vertical and horizontal directions on the image in thepreview display section 61.

When any of the arrow buttons 121 to 124 is clicked, the pan/tilt can bemade in the direction of the clicked arrow button. Specifically, whenthe upward arrow button 121 is clicked, for example, the display controlpart 25 trims the image at t=3 in FIG. 8 to display an image as shown inB of FIG. 11 in the preview display section 61.

Note that in this case also, the reproducing position is t=3, but isdifferent from the usual reproducing position display since the imagedisplayed in the preview display section 61 does not coincide with theactual image at t=3. Therefore, the marker indicating the reproducingposition may be changed in color and displayed.

Example of Camera Motion Information (Pan/Tilt Repetition)

FIG. 12 is a diagram showing an example of a scene shot with repeatedpan/tilt by a camera from t=0 to t=8.

In the case of a video in FIG. 12, the space recognition part 30 canrecognize identity of an object in the video from for the camera motioninformation and image feature amount in the content feature amountdatabase 24 to recognize that a space as shown in A of FIG. 13 can bereproduced. Moreover, at the same time, the space recognition part 30can recognize that the camera was moved in numerical order of number ofarrows shown in B of FIG. 13 to shoot images in FIG. 12.

In other words, the space recognition part 30 recognizes that images 151to 159 in B of FIG. 13 correspond to the images from t=0 to t=8 in FIG.12, respectively.

For example, in the case where reproduction is started from an image att=0 and paused at a timing of t=4, the space recognition part 30 canrecognize that a space extends around the image at t=4.

Therefore, the display control part 25, in the preview display section61 paused at t=4, as shown in FIG. 14, superimposes a plus mark button171 and minus mark button 172 for zoom-in and zoom-out as the GUI on theimage at t=4. Further, the display control part 25, as shown in FIG. 14,superimposes arrow buttons 181 to 184 for pan/tilt in the vertical andhorizontal directions as the GUI on the image at t=4 in the previewdisplay section 61.

Note that the buttons for zoom-in and zoom-out may be the arrow buttonsdescribed above in B of FIG. 9.

Here, as for the pan in the right and left directions, the frames arecontinuous in the content. Accordingly, for example, when the leftwardarrow button 182 is clicked, the camera angle is panned leftward tobring the reproducing position into t=3, and thus, the display in thepreview display section 61 is switched from the image at t=4 to theimage at t=3 in FIG. 12.

On the other hand, as for the tilt, and zoom-in or zoom-out in the upand down directions, the frames are not continuous in the content.Accordingly, for example, when the upward arrow button 181 is clicked,the display control part 25 trims away the image 152 which is an upperportion of the image at t=4 in FIG. 12 from the image with the spacereproduced in B of FIG. 13 to be displayed in the preview displaysection 61.

In this case, the displayed image is an image reproduced from the imageat t=4 and other images, and thus, the reproducing position displayindicates a timing of the original material used for generating thereproduction image. That is, in case of reproducing from the image att=4 and the image at t=1, the timings of t=4 and t=1 are indicated asthe reproducing position. In this case, since the reproducing positionis different from the usual reproducing position display, the markerindicating the reproducing position may be changed in color.

In this case, the similar operation can be made not only by clicking thebutton but also by directing the directions of pan/tilt and zoom-in orzoom-out using mouse operation such as a mouse gesture or mouse wheel.Moreover, the similar operation can be made by directing the directionof zoom-in or zoom-out using a finger gesture on the touch panellaminated on the display part 27.

Additionally, these buttons can be displayed even during reproduction ofthe moving picture, and for example, in the case where the left arrowbutton 182 is clicked at the moment of t=5 during reproduction of themoving picture, reverse reproduction may be performed from the image att=5 to the image at t=3.

As described above, in the preview screen for checking a substance ofcontent, the camera angle and zoom-in or zoom-out can be operatedwithout regard to a reproducing timing or reproducing direction of thecontent in terms of the space extending in the preview image. Therefore,operability for grasping a substance of content can be improved.

The series of processes described above can be executed by hardware butcan also be executed by software. When the series of processes isexecuted by software, a program that constructs such software isinstalled into a computer. Here, the expression “computer” includes acomputer in which dedicated hardware is incorporated and ageneral-purpose personal computer or the like that is capable ofexecuting various functions when various programs are installed,

2. Second Embodiment (Computer) Configuration Example of Computer

FIG. 15 illustrates a configuration example of hardware of a computerthat executes the above series of processes by programs.

In a computer 300, a central processing unit (CPU) 301, a read onlymemory (ROM) 302 and a random access memory (RAM) 303 are mutuallyconnected by a bus 304.

An input/output interface 305 is also connected to the bus 304. An inputunit 306, an output unit 307, a storage unit 308, a communication unit309, and a drive 310 are connected to the input/output interface 305.

The input unit 306 is configured from a keyboard, a mouse, a microphoneor the like. The output unit 307 configured from a display, a speaker orthe like. The storage unit 308 is configured from a hard disk, anon-volatile memory or the like. The communication unit 309 isconfigured from a network interface or the like. The drive 310 drives aremovable recording medium 311 such as a magnetic disk, an optical disk,a magneto-optical disk, a semiconductor memory or the like.

In the computer configured as described above, the CPU 301 loads aprogram that is stored, for example, in the storage unit 308 onto theRAM 303 via the input/output interface 305 and the bus 304, and executesthe program. Thus, the above-described series of processing isperformed.

As one example, the program executed by the computer (the CPU 301) maybe provided by being recorded on the removable recording medium 311 as apackaged medium or the like. The program can also be provided via awired or wireless transfer medium, such as a local area network, theInternet, or a digital satellite broadcast.

In the computer, by loading the removable recording medium 311 into thedrive 310, the program can be installed into the storage unit 308 viathe input/output interface 305. It is also possible to receive theprogram from a wired or wireless transfer medium using the communicationunit 309 and install the program into the storage unit 308. As anotheralternative, the program can be installed in advance into the ROM 302 orthe storage unit 308.

It should be noted that the program executed by a computer may be aprogram that is processed in time series according to the sequencedescribed in this specification or a program that is processed inparallel or at necessary timing such as upon calling.

In the present disclosure, steps of describing the above series ofprocesses may include processing performed in time-series according tothe description order and processing not processed in time-series butperformed in parallel or individually.

The embodiment of the present disclosure is not limited to theabove-described embodiment. It should be understood by those skilled inthe art that various modifications, combinations, sub-combinations andalterations may occur depending on design requirements and other factorsinsofar as they are within the scope of the appended claims or theequivalents thereof.

For example, the present technology can adopt a configuration of cloudcomputing which processes by allocating and connecting one function by aplurality of apparatuses through a network.

Further, each step described by the above mentioned flow charts can beexecuted by one apparatus or by allocating a plurality of apparatuses.

In addition, in the case where a plurality of processes is included inone step, the plurality of processes included in this one step can beexecuted by one apparatus or by allocating a plurality of apparatuses.

Further, an element described as a single device (or processing unit)above may be divided to be configured as a plurality of devices (orprocessing units). On the contrary, elements described as a plurality ofdevices (or processing units) above may be configured collectively as asingle device (or processing unit). Further, an element other than thosedescribed above may be added to each device (or processing unit).Furthermore, a part of an element of a given device (or processing unit)may be included in an element of another device (or another processingunit) as long as the configuration or operation of the system as a wholeis substantially the same. In other words, an embodiment of thedisclosure is not limited to the embodiments described above, andvarious changes and modifications may be made without departing from thescope of the technology.

Although the preferred embodiments of the present disclosure have beendescribed in detail with reference to the appended drawings, the presentdisclosure is not limited thereto. It is obvious to those skilled in theart that various modifications or variations are possible insofar asthey are within the technical scope of the appended claims or theequivalents thereof. It should be understood that such modifications orvariations are also within the technical scope of the presentdisclosure.

Additionally, the present technology may also be configured as below.

-   (1) An information processing apparatus including:

a plurality of feature amount extraction parts configured to extract,from content, a plurality of feature amounts containing informationconcerning a camera motion in a frame; and

a display control part configured to display a GUI corresponding to anoperation concerning the camera motion in the frame on the frame in thecontent on the basis of the information concerning the camera motionextracted by the feature amount extraction parts.

-   (2) The information processing apparatus according to (1), further    including:

a space recognition part configured to recognize a relationship in aspace between frames using the information concerning the camera motionand an image feature amount of the plurality of feature amounts,

wherein the display control part displays the GUI corresponding to theoperation concerning the camera motion in the frame on the frame in thecontent on the basis of the information concerning the camera motionextracted by the feature amount extraction parts and the relationship inthe space between the frames recognized by the space recognition part.

-   (3) The information processing apparatus according to (1) or (2),    wherein the GUI corresponding to the operation concerning the camera    motion is a GUI enabling a camera angle to be switched to another    angle.-   (4) The information processing apparatus according to any one of (1)    to (3), wherein the display control part switches a display of the    frame to a display of a frame of an angle corresponding to an    operation on the GUI by a user.-   (5) The information processing apparatus according to (1) or (2),    wherein the GUI corresponding to the operation concerning the camera    motion is a GUI enabling a zoom-in or zoom-out operation.-   (6) The information processing apparatus according to (1), (2), or    (5), wherein the display control part switches a display of the    frame to a display of a frame of zoom-in or zoom-out corresponding    to an operation on the GUI by a user.-   (7) An information processing method including:

extracting, by an information processing apparatus, from content, aplurality of feature amounts containing information concerning a cameramotion in a frame; and

displaying, by the information processing apparatus, a GUI correspondingto an operation concerning the camera motion in the frame on the framein the content on the basis of the extracted information concerning thecamera motion.

-   (8) A program causing a computer to function as:

a plurality of feature amount extraction parts configured to extract,from content, a plurality of feature amounts containing informationconcerning a camera motion in a frame; and

a display control part configured to display a GUI corresponding to anoperation concerning the camera motion in the frame on the frame in thecontent on the basis of the information concerning the camera motionextracted by the feature amount extraction parts.

1. An information processing apparatus comprising: a plurality offeature amount extraction parts configured to extract, from content, aplurality of feature amounts containing information concerning a cameramotion in a frame; and a display control part configured to display aGUI corresponding to an operation concerning the camera motion in theframe on the frame in the content on the basis of the informationconcerning the camera motion extracted by the feature amount extractionparts. 2-8: (canceled)